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Further to our previous conversation, I am faxing you a copy of the English translation of 
Priority Document No. 9-323129 filed on November 25, 1997. Please call me at (617) 227- 
7400 Ext. 251, if I can be of further assistance. 

Sincerely, 


Cynthia L. Kanik, Ph.D. 
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Document Name: Specification 

Title of the Invention: Human Proteins Having 

Transmembrane Domains and DNAs Encoding These Proteins 
Claim(s) : 

5 1, A protein comprising any one of the amino 

acid sequences represented by Sequence Nos. 1 to 3. 

2 • A DNA coding for the protein according to 

Claim 1 . 

3. A cDNA comprising any one of the base 
10 sequences represented by Sequence Nos. 4 to 6. 

4- The cDNA according to Claim 3 consisting of 
any one of the base sequences represented by Sequence Nos. 
7 to 9. 

5. An expression vector capable of expressing 
15 the DNA according to any one of Claims 2 to 4 by in vitro 

translation or in eucaryotic cells . 

6. A transformed eucaryotic cell capable of 
expressing the DNA according to any one of Claims 2 to 4 
and of producing the protein according to Claim 1, 

20 Detailed Explanation of the Invention; 
[0001] 
Art Field Related: 

The present invention relates to human proteins 
having transmembrane domains, cDNAs coding for these 
25 proteins, and expression vectors of said cDNAs as well as 
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eucaryotic cells expressing said cDNAs . The proteins of the 
present invention can be employed as pharmaceuticals or as 
antigens for preparing antibodies against said proteins. 
The human cDNAs of the present invention can be utilized as 
5 probes for the gene diagnosis and gene sources for the gene 
therapy. Furthermore, the cDNAs can be utilized as gene 
sources for large-scale production of Che proteins encoded 
by said cDNAs , Cells, wherein these membrane protein genes 
are introduced and membrane proteins are expressed in large 
10 amounts, can be utilized for detection of the corresponding 
ligands, screening of novel low-molecular pharmaceuticals, 
and so on. 

[0002] 

Prior Art; 

15 Membrane proteins play important roles, as signal 

receptors, ion channels, transporters, etc. in the material 
transportation and the information transmission which are 
mediated by the cell membrane. Examples thereof include 
receptors for a variety of cytokines, ion channels for the 

20 sodium ion, the potassium ion, the chloride ion, etc., 
transporters for saccharides and amino acids, and so on, 
where the genes of many of them have been cloned already. 
[0003J 

It has been clarified that abnormalities of these 
25 membrane proteins are associated with a number of hitherto- 
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cryptogenic diseases. For instance, a gene of a membrane 
protein having twelve transmembrane domains was identified 
as the gene responsible for cystic fibrosis [Roramens, J. M, 
et al., Science 245; 1059-1065 (1989)]- In addition, it has 
5 been clarified that several membrane proteins act as 
receptors when a virus infects the cells. For instance, 
HIV-l is revealed to infect into the cells through 
mediation of a membrane protein on the T-cell membrane, a 
CD-4 antigen, and a membrane protein having seven 

10 transmembrane domains, fusin [Feng, Y. et al., Science 272: 
872-877 (1996)], Therefore, discovery of a new membrane 
protein is anticipated to lead to elucidation of the causes 
of many diseases, so that isolation of a new gene coding 
for the membrane protein has been desired. 

15 [0004] 

Heretofore/ owing to difficulty in the 
purification, many membrane proteins have been isolated by 
an approach from the gene side. A general method is the so- 
called expression cloning which comprises transfection of a 

20 cDNA library in eucaryotic cells to express cDNAs and then 
detection of the cells expressing the target membrane 
protein on the membrane by an immunological technique using 
an antibody or a physiological technique on the change in 
the membrane permeability. However, this method is 

25 applicable only to cloning of a gene of a membrane protein 
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with a known function. 
[0005] 

In general, membrane proteins possess hydrophobic 
transmembrane domains inside the proteins, wherein, after 
5 synthesis thereof in the ribosome, these domains remain m 
the phospholipid membrane to be trapped in the membrane. 
Accordingly, the evidence of the cDNA for encoding the 
membrane protein is provided by determination of the whole 
base sequence of a full-length cDNA followed by detection 
10 of highly hydrophobic transmembrane domains in the amino 
acid sequence of the protein encoded by said cDNA, 
[0006] 

Problems to be Solved by the Invention: 

The object of the present invention is to provide 
15 novel human proteins having transmembrane domains, DNAs 
coding for said proteins, and expression vectors of said 
cDNAs as well as transformed eucaryotic cells that are 
capable of expressing said cDNAs. 
[0007] 

20 Means to Solve the Problems: 

As the result of intensive studies, the present 
inventors have been successful in cloning of cDNAs coding 
for proteins having transmembrane domains from the human 
full-length cDNA bank, thereby completing the present 

25 invention. In other words, the present invention provides 
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human proteins having transmembrane domains, namely 
proteins containing any of the amino acid sequences 
represented by Sequence Nos. 1 to 3, Moreover, the present 
invention provides DNAs coding for the above-mentioned 
5 proteins, exemplified by cDNAs containing any of the base 
sequences represented by Sequence Nos. 4 to 9 as well as 
transformed eucaryotic cells that are capable of expressing 
said cDNAs. 

[0008] 

10 Mode for Carrying out the Invention 

The proteins of the present invention can be 
obtained, for example, by a method for isolation from human 
organs, cell lines, etc., a method for preparation of 
peptides by the chemical synthesis based on the amino acid 

15 sequences of the present invention, or a method for 
production with the recombinant DNA technology using the 
DNAs coding for the transmembrane domains of the present 
invention, wherein the method for obtainment by the 
recombinant DNA technology is employed preferably. For 

20 instance, in vitro expression of the proteins can be 
achieved by preparation of an RNA by in vitro transcription 
from a vector having one of cDNAs of the present invention, 
followed by in vitro translation using this RNA as a 
templare. Also, recombination of the translation region 

2 5 into a suitable expression vector by the method known in 
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the art leads to production of a large amount of the 
encoded protein by using prokaryotic cells such as 
Escherichia colx, Bacillus subtilis, etc., and eucaryotic 
cells such as yeasts, insect cells, mammalian cells, etc. 
5 [0009] 

In the case in which a protein of the present 
invention is produced by expression of one of the DNAs by 
in vitro translation, recombination of the translation 
region in said cDNA into a vector having an RNA polymerase 

10 promoter, followed by addition into an in vitro translation 
system such as a rabbit reticulocyte lysate, a wheat germ 
extract or the like, which contains an RNA polymerase 
corresponding to the promoter, allows in vitro production 
of the protein of the present invention. Examples of the 

15 RNA polymerase promoter include T7, T3, Sp6, and so on. 
Vectors containing such an RNA polymerase promoter are 
exemplified by pKAl, pCDM8, pT3/T7 18, pT7/3 19, 
pBluescript II, and so on. Also, addition of the dog 
pancreas microsome etc, in the reaction system enables the 

20 membrane protein of the present invention to be expressed 
m a form integrated in the microsome membrane. 
[0010] 

In the case in which a protein of the present 
invention is produced by expression of a DNA in a 
25 microorganism such as Escherichia coli etc., a recombinant 
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expression vector bearing the translation region in the 
cDNA of the present invention xs constructed in an 
expression vector having an origin, a promoter, a ribosome- 
bindmg site, a cDNA-cioning site, a terminator etc., which 
5 can be replicated in the microorganism, and, after 
transformation of the host cells with said expression 
vector, the thus-obtained transformant is incubated, 
whereby the protein encoded by said cPNA can be produced on 
a large scale in the microorganism- In this case, a protein 

10 fragment containing an optional region can be obtained by 
carrying out the expression with inserting an initiation 
codon and a termination codon in front of and behind an 
optional translation region. Alternatively, a fusion 
protein with another protein can be expressed. Only a 

15 protein portion encoded by said cDNA can be obtained by 
cleavage of said fusion protein with a suitable protease - 
Examples of the expression vector for Escherichia coll 
include the pUC system, pBluescript II, the p£T expression 
system, the pG£X expression system, and so on. 

20 [0011] 

In the case in which one of the proteins of the 
present invention is produced by expression of a DNA m 
eucaryotic cells, the protein of the present invention can 
be produced as a membrane protein on the cell -membrane 

25 surface, when the translation region of said cDNA is 
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subjected to recombination to an expression vector for 
eucaryotic cells that has a promoter, a splicing region, a 
poly (A) insertion site, etc., followed by introduction into 
the eucaryotic cells. The expression vector is exemplified 
5 by pKAl, pCDMS, pSVK3, pMSG, pSVL, pBft-CMV, pBK-RSV, EBV 
vector, pRS, pYES2, and so on. Examples of eucaryotic cells 
to be used in general include mammalian culture cells such 
as simian kidney cells COS7, Chinese hamster ovary cells 
CHO, etc., budding yeasts, fission yeasts, silkworm cells, 

10 Xenopus laevis egg cells, and so on, but any eucaryotic 
cells may be used, provided that they are capable of 
expressing the present proteins on the membrane surface. 
The expression vector can be introduced in the eucaryotic 
cells by methods known in the art such as the 

15 electroporation method, the calcium phosphate method, the 
liposome method, the DEAE^dextran method, and so on. 
[0012] 

After one of the proteins of the present 
invention is expressed in prokaryotic cells or eucaryotic 

20 cells, the objective protein can be isolated from the 
culture and purified by a combination of separation 
procedures known in the art. Such examples include 
treatment with a denaturing agent such as urea or a 
surface-active agent, sonication, enzymatic digestion, 

25 salting-out or solvent precipitation, dialysis, 
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9 

centnfugation, ulcraf iltration, gei filtration, SPS-PAGE, 
isoelectric focusing, ion-exchange chromatography, 
hydrophobic chromatography, affinity chromatography, 
reverse phase chromatography, and so on* 

5 [0013] 

The proteins of the present invention include 
peptide fragments (5 amino acid residues or more) 
containing any partial amino acid sequence in the amino 
acid sequences represented by Sequence Nos. 1 to 3. These 

10 peptide fragments can be utilized as antigens for 
preparation of antibodies- Hereupon, among the proteins of 
the present invention, those having the signal sequence are 
secreted in the form of maturation proteins on the surface 
of the cells, after v the signal sequences are removed. 

15 Therefore, these maturation proteins shall come within the 
scope of the protein of the present invention. The N- 
terminal amino acid sequences of the maturation proteins 
can be easily identified by using the method for the 
cleavage-site determination m a signal sequence [Japanese 

20 Patent Kokai Publication No. 1996-187100], Furthermore, 
some membrane proteins undergo the processing on the cell 
surface co be converted to the secretory forms. Such 
proteins or peptides in the secretory forms shall come 
within the scope of the protein of the present invention. 

25 When sugar chain-binding sites are present in the amino 
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acid sequences, expression in appropriate eucaryotxc cells 
affords proteins wherein sugar chains are added. 
Accordingly, such proteins or peptides wherein sugar chains 
are added shall come within the scope of the protein of the 
5 present invention - 
[0014] 

The DNAs of the present invention include all 
DNAs coding for the above-mentioned proteins. Said DNAs can 
be obtained by using a method by chemical synthesis, a 
10 method by cDNA cloning, and so on, 
[00153 

The cDNAs of the present invention can be cloned, 
for example, from cDNA libraries of the human cell origin. < 
These cDNA are synthesized by using as templates poly (A) T 

15 RNAs extracted from human cells. The human cells may be 
cells delivered from the human body, for example, by the 
operation or may be the culture cells. The cDNAs can be 
synthesized by using any method selected from the Okayama- 
Berg method [okayama, H- and Berg, P. , Mol. Cell. Biol. 2: 

20 161-170 (1982) ]f the Gubler-Hof fman method [Gubler, U. and 
Hoffman, J. Gene 25: 263-269 (1983)], and so on, but it is 
preferred to use the capping method [Kato, S. et al., Gene 
163: 193-196 (1995)], as exemplified in Examples, in order 
to obtain a full-length clone in an effective manner. In 

25 addition, commercially available, human cDNA libraries can 
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be utilized. Cloning of the cDNAs of the present invention 
from the cDNA libraries can be carried out by synthesis of 
an oligonucleotide on the basis of an optional portion in 
the cDNA base sequences of the present invention/ followed 
5 by screening using this oligonucleotide as the probe 
according to the colony or plaque hybridization by a method 
known in the art. In addition, the cDNA fragments of the 
present invention can be prepared by synthesis of an 
oligonucleotide to be hybridized at both termini of the 
10 objective cDNA fragment, followed by the usage of this 
oligonucleotide as the primer for the RT-pCR method from an 
mRNA isolated from human cells. 
[0016] 

The cDNAs of the present invention are 
15 characterized by containing either of the base sequences 
represented by Sequence Nos. 4 to 6 or the base sequences 
represented by Sequence Nos. 7 to 9. Table 1 summarizes the 
clone number (HP number) , the cells affording the cDNA 
clone, the total base number of the cDNA, and the number of 
20 the ammo acid residues of the encoded protein, for each of 
the cDNAs. 
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[0017] 

[Table 1] 
Table 1 


5 

Sequence No. 

HP No. 

Cell Number of 
bases 

Number of 
amino acids 


1, 4/ 7 

HP01207 

Stomach Cancer 

2938 

269 

10 

2, 5, 8 

HF01862 

Stomach Cancer 

2290 

311 


3, 6, 9 

BP10493 

PMA-U937 

3705 

383 


[0018] 

15 Hereupon, the same clones as the cONAs of the 

present invention can be easily obtained by screening of 
the cDNA libraries constructed from the human cell lines 
and human tissues utilized in the present invention by the 
use of an oligonucleotide probe synthesized on the basis of 

20 the cDNA base sequence described in any of Sequence Nos. 4 
to 9. 

[0019] 

In general, the polymorphism due to the 
individual difference is frequently observed in human genes, 
25 Accordingly, any cDNA that is subjected to insertion or 
deletion of one or plural nucleotides and/or substitution 
with other nucleotides in Sequence Nos. 4 to 9 shall come 
within the scope of the present invention. 
[0020] 


Received from < 6177424214 > at 4/26102 1 1 :41 :44 AM [Eastern Daylight Time] 


Apr-26-02 10:27 From-LAHIVE & COCKFIELD, LLP 6177424214 T-604 P. 18/64 F-964 

13 

In a similar manner, any protein that is formed 
by these modifications comprising insertion or deletion of 
one or plural amino acids and/or substitution with other 
amino acids shall come within the scope of the present 
5 invention, as far as the protein possesses the activity of 
any protein having the amino acid sequences represented by 
Sequence Nos, 1 to 3. 
[0021] 

The cDNAs of the present invention include cDNA 
10 fragments (10 bp or more} containing any partial base 
sequence in the base sequences represented by Sequence Nos- 
4 to 6 or in the base sequences represented by Sequences No, 
7 to 9. Also, DNA fragments consisting of a sense chain and 
an anti-sense chain shall come within this scope. These DNA 
15 fragments can be utilized as the probes for the gene 
diagnosis. 

[0022] 

Examples 

The present invention is embodied in more' detail 
20 by the following examples, but this embodiment is not 
intended to restrict the present invention. The basic 
operations and the enzyme reactions with regard to the DNA 
recombination are carried out according to the literature 
[ "Molecular Cloning- A Laboratory Manual" , Cold Spring 
25 Harbor Laboratory, 1989] . Unless otherwise stated, 
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restrictive enzymes and a variety of modification enzymes 
to be used were those available from TAKARA SHUZO, The 
manufacturer's instructions were used for the buffer 
compositions as well as for the reaction conditions, in 
5 each of the en2yme reactions. The cDNA synthesis was 
carried out according to the literature [Kato, S. et al,, 
Gene 150: 243-250 (1994)]. 
[0023] 

(1) Preparation of Poly (A) T RNA 

IQ The histiocyte lymphoma cell line U937 (ATCC CRL 

1593) stimulated by phorbol ester and tissues of stomach 
cancer delivered by the operation were used for human cells 
to extract mRNAs. The cell line was incubated by a 
conventional procedure, 

15 [00241 

After about 1 g of the human cells was 
homogenized in 20 ml of a 5.5 M guanidium thipcyanate 
solution, a total mRNA was prepared according to the 
literature [Okayama, H. et al., ^Method in En^ymology" , Vol. 

20 164, Academic Press, 1987]. This was subjected to 
chromatography on oligo <dT) -cellulose column washed with a 
20 *uM Tris-hydrochloride buffer solution (pH 7.6), 0.5 M 
NaCl, and 1 mM EDTA to obtain a poly (A) T RNA according to 
the above-described literature. 

25 [0025] 
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(2) Construction of cDNA Library 

Ten micrograms of the above-mentioned paly (A) r 
RNA were dissolved in a 100 mM Tris-hydrochloride buffer 
solution (pH 8), one unit of an RNase-free, bacterial 
5 alkaline phosphatase was added, and the reaction was run at 
37 V for one hour. After the reaction solution was 
subjected to phenol extraction, followed by ethanol 
precipitation, the resulting pellet was dissolved in a 
solution containing 50 mM sodium acetate (pH 6), 1 mM £DTA/ 
10 0.1% 2-mercaptoethanol, and 0.01% Triton X-100. Thereto was 
added one unit of a tobacco-origin acid pyrophosphatase 
(Epicentre Technologies) and a total 100 ^1 volume of the 
resulting mixture was reacted at 37 X: for one hour. After 
the reaction solution was subjected to phenol extraction, 
15 followed by ethanol precipitation, the resulting pellet was 
dissolved in water to obtain a solution of a decapped 
poly (A) T RNA. 

[0026] 

The decapped poly (A) + RNA and 3 nmol of a 
20 chimeric DNA-RNA oligonucleotide (5' -dG-dG-dG-dG~dA-dA-dT- 
dT-dC-dG-dA-G-G-A-3' ) were dissolved in an aqueous solution 
containing 50 mM Tris-hydrochloride buffer solution (pH 
7.5), 0.5 mM ATP, 5 mM MgCl 2 , 10 mM 2-mercaptoethanol, and 
25% polyethylene glycol, whereto was added 50 units of T4 
25 RNA ligase and a total 30 \xl volume of the resulting 
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mixture was reacted at 20 X: for 12 hours. After the 
reaction solution was subjected to pnenol extraction/ 
followed by ethanol precipitation, the resulting pellet was 
dissolved in water to obtain a chimerxc-oligo-capped 

5 poly [A.) RNA, 

[0027] 

After digestion of vector pKAl (Japanese Patent 
Kokai Publicarion No. 1992-117292) developed by the present 
inventors with Kpnl, about 60 dT tails were added using a 
10 terminal transferase. A vector primer to be used below was 
prepared by digestion of this product with EcoRV to remove 
a dT tail at one side, 
[0028] 

After 6 |ig of the previously-prepared chimeric- 
15 oligo-capped poly (A) + RNA was annealed with 1.2 ng of the 
vector primer, the resulting product was dissolved in a 
solution containing 50 mM Tris-hydrochloride buffer 
solution <pH 8-3), 75 mM KC1, 3 mM MgCl., 10 mM 
dithiothreitol, and 1.25 mM dNTP (dATP + dCTP + dGTP + 
20 dTTP) , 200 units of a reverse transcriptase (GIBCO-BRL) 
were added, and the reaction in a total 20 \il volume was 
run at 42*C for one hour. After the reaction solution was 
subjected to phenol extraction, followed by ethanol 
precipitation, the resulting pellet was dissolved in a 
25 solution containing 50 mM Tris-hydrochloride buffer 
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solution (pH 7.5), 100 mM NaCl, 10 mM MgCl s , and 1 mM 
dithiothreitol. Thereto were added 100 units of EcoRI and a 
total 20 \xl volume of the resulting mixture was reacted at 
37 <C for one hour- After the reaction solution was 
5 subjected to phenol extraction/ followed by ethanol 
precipitation, the resulting pellet was dissolved in a 
solution containing 20 mM Tns-hydrochlonde buffer 
solution <pH 7.5) f 100 mM KC1, 4 mM MqCl a , 10 mM (NHJ a SO„ 
and 50 \iq/m\ of the bovine serum albumin. Thereto were 

10 added 60 units of an Escherichia coli DNA ligase and the 
resulting mixture was reacted at 16TC for 16 hours. To the 
reaction solution were added 2 \xl of 2 W dNTP, 4 units of 
Escherichia coli DNA polymerase I, and 0.1 unit of 
Escherichia coli RNase H and the resulting mixture was 

15 reacted at 12*0 for one hour and then at 22t for one hour, 
[0029] 

Next, the cDNA~syn thesis reaction solution was 
used for transformation of Escherichia coli DH12S (GIBCO- 
BRL) . The transformation was carried out by the 

20 electroporation method, A portion of the transformant was 
spread on the 2xYT agar culture medium containing 100 yg/ml 
ampicillin and the medium was incubated at 37*C overnight. 
A colony formed on the agar medium was picked up at random 
and inoculated on 2 ml of the 2xYT culture medium 

25 containing 100 pg/ml ampicillin. After incubation at 37*0 
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overnight, the culture mixture was centrifuged to separate 
the cells, from which a plasmid DNA was prepared by the 
alkaline lysis method. The plasmid DNA was subjected to 
double digestion with EcoRl and NotI, followed by 0.8% 
5 agarose gel electrophoresis, to determine the size of the 
cum insert. Furthermore, using the thus-obtained plasmid 
as a template, the sequence reaction was carried out by 
using an M13 universal primer labeled with a fluorescent 
dye and a Taq polymerase (a kit of Applied Biosystems) and 
10 then the product was examined with a fluorescent DNA 
sequencer (Applied Biosystems) to determine an about 400-bp 
base sequence at the 5' -terminus of the cDNA. The sequence 
data were filed as the homo/protein cDNA bank database, 
[0030] 

15 (3) Selection of cDNAs Encoding Proteins Having 
Transmembrane Domains 

A base sequence registered in the homo/protein 
cDNA bank was converted to three frames of amino acid 
sequences and the presence or absence of an open reading 

20 frame (ORF) beginning from the initiation codon was 
examined. Then, the selection was made for the presence of 
a signal sequence that is characteristic to a secretory 
protein at the N-terminus of the portion encoded by the ORF. 
These clones were sequenced from the both 5' and 3' 

25 directions by the use of the deletion method using 
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exonuclease III to determine the whole base sequence. The 
hydrophobicity/hydrophilicity profiles were obtained for 
proteins encoded by the ORF by the Kyte-Doolittle method 
[Kyte, J- t Doolittle, R. F. f J. Mol- Biol. 157: 105-132 
5 (1982JJ to examine the presence or absence of a hydrophobic 
region. In the case in which there is a hydrophobic region 
of a putative transmembrane domain in the amino acid 
sequence of an encoded protein, this protein was judged as 
a membrane protein. 
10 [0031] 

(4) Functional Verification of Secretory Signal Sequence 
or Transmembrane Domains 

It was verified by the method described in the 
literature [Yokoyama-Kobayashi, M. et al., Gene 163: 193- 
15 19S (1995] ] that the N-terminal hydrophobic region in the 
secretory protein clone candidate obtained in the above- 
mentioned steps functions as a secretory signal sequence. 
First, the plasmid containing the target cDNA was cleaved 
at an appropriate restriction en2yme site existing at the 
20 downstream of the portion expected for encoding the 
secretory signal sequence. In the case in which this 
restriction site was a protruding terminus, the site was 
blunt-ended by the Klenow treatment or treatment with the 
mung-bean nuclease. Digestion with Hindlll was further 
25 carried out and a DNA fragment containing the SV40 promoter 
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and a cDNA encoding the secretory signal sequence at the 
downstream of the promoter was separated by agarose gel 
electrophoresis- The resulting fragment was inserted 
between Hindlll in pSSD3 [ DDB J/EMBL/ GenBank Registration No. 
5 AB007632) and a restriction enzyme site selected so as to 
match with the urokinase-coding frame, thereby constructing 
a vector expressing a fusion protein of the secretory 
signal sequence of the target cDNA and the urokinase 
protease domain. 
10 [0032] 

After Escherichia coll (host: JM109) bearing the 
fusion-protein expression vector was incubated at 37*C for 
2 hours in 2 ml of the 2xYT culture medium containing 100 /i 
g/ml of ampicillin, the helper phage M13K07 (50 u 1) was 
15 added and the incubation was continued at 37^ overnight, A 
supernatant separated by centrif ugation underwent 
precipitation with polyethylene glycol to obtain single- 
stranded phage particles. These particles were suspended in 
100 |4l of 1 mM Tris-0,1 mM EDTA, pH 8 (TE) - Also, there 
20 were used as controls suspensions of single-stranded phage 
particles prepared in the same manner from pSSD3 and from 
the vector* pKAl-UPA containing a full-length cDNA of 
urokinase [Yokoyama-Kobayashi, M- et al-/ Gene 163: 193-196 
(1995)] - 
25 [0033] 


Received from < 6177424214 > at 4126/02 11:41:44 AM [Eastern Daylight Time] 


Apr-26-02 10:28 From-LAHIVE & COCKFtELO, LLP 6177424214 T-604 P. 26/64 F- 

21 

The culture cells originating from the simian 
kidney, COS7, were incubated at 37 t in the presence of 5% 
C0 2 in the Dulbecco's modified Eagle's culture medium 
(DMEM) containing 10% fetal calf serum. Into a 6-well plate 
5 (Nunc Inc., 3 cm in the well diameter) were inoculated 1 X 
10 5 COS7 cells and incubation was carried out at 37 XI for 
22 hours in the presence of 5% CO,. After the culture 
medium was removed, the cell surface was washed with a 
phosphate buffer solution and then washed again with DMEM 
10 containing 50 mM Tris-hydrochloric acid (pH 7.5) (TDMEM) . 

To the resulting cells was added a suspension of 1 \il of 
the single-stranded phage suspension, 0,6 ml of the DMEM 
culture medium, and 3 ^1 of TRANS FECTAM w (IBF Inc.) and 
the resulting mixture was incubated at 37*C for 3 hours in 

15 the presence of 5% C0 2 . After the sample solution was 
removed, the cell surface was washed with TDMEM, 2 ml per 
well of DMEM containing 10% fetal calf serum was added, and 
the incubation was carried out at 37 *C for 2 days in the 
presence of 5% C0 2 . 

20 [0034] 

To 10 ml of a 50 aM phosphate buffer solution (pH 
7.4) containing 2% bovine fibrinogen (Miles Inc.), 0.5% 
agarose, and 1 mM calcium chloride were added 10 units of 
human thrombin (Mochida Pharmaceutical Co., Ltd.) and the 

25 resulting mixture was solidified in a plate of 9 cm in 
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diameter to prepare a fibrin plate. Ten microliters of the 
culture supernatant of the tansfected COS7 cells were 
spotted on the fibrin plate, which was incubated at 37 *C 
for 15 hours. In the case in which a clear circle appears 
5 on the fibrin plate, it is judged that the cDNA fragment 
codes for the amino acid sequence functioning as a 
secretory signal sequence. On the other hand, in case in 
which a clear circle is not formed, the cells were washed 
well, then the fibrin sheet was placed on the cells, and 

10 incubation was carried out at 37*C for 15 hours. In case in 
which a clear portion is formed on the fibrin sheet, it 
indicates that the urokinase activity was expressed on the 
cell surface. In other words, the cDNA fragment is judged 
to code for the transmembrane domains- 

15 [0035] 

(5) Protein Synthesis by In Vitro Translation 

The plasmid vector bearing the cDNA of the 
present invention was used for in vitro 
transcription/translation with a T„T rabbit reticulocyte 

20 lysate Kit (Promega) . In this case, [ 35 S] methionine was 
added to label the expression product with a radioisotope. 
Each of the reactions was carried out according to the 
protocols attached to the kit. Two micrograms of the 
plasmid was reacted at 30t for 90 minutes in a total 25 (il 

25 volume of the reaction solution containing 12.5 fil of T M T 
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rabbit reticulocyte lysate, 0,5 ^1 of a buffer solution 
(attached to kit), 2 jil of an amino acid mixture 
(methionine-f ree) , 2 jU of [ 35 S] methionine (Amersham)' (0.37 
M8q/|*l}, 015 (al of T7 RNA polymerase, and 20 U of RNasin. 
5 To 3 ^1 of the resulting reaction solution was added 2 \il 
of the SDS sampling buffer (125 mH Tris-hydrochloric acid 
buffer, pH 6.8, 120 mM 2-mercaptoethanol, 2% SDS solution, 
0,025% bromophenol blue, and 20% glycerol) and the 
resulting mixture was heated at 95*C for 3 minutes and then 
10 subjected to SDS-polyacrylamide gel electrophoresis. The 
molecular weight of the translation product was determined 
by carrying out the autoradiography 
[0036] 

(6) Expression by C0S7 

15 Escherichia coli bearing the expression vector of 

the protein of the present invention was infected with 
helper phage M13K07 and single-stranded phage particles 
were obtained by the above-mentioned procedure. The thus- 
obtained phage was used for introducing each expression 

20 vector in the culture cells originating from the simian 
kidney, COS7 by the above-mentioned procedure. After 
incubation at 37 *C for 2 days in the presence of 5% CO,, 
the incubation was continued for one hour in the culture 
medium containing [ 35 S]cysteine or [ i5 S] methionine. 

25 Collection and lysis of the cells, followed by subjecting 
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to SDS-PAGE, allowed to observe the presence of a band 
corresponding to the expression product of each protein, 
which did not exist in the COS7 cells, 
[0037] 

5 (7) Clone Examples 

<HP01207> (Sequence Nos. 1, 4, and 7) 

Determination of the whole base sequence of the 
cDNA insert of clone HP01207 obtained from cDNA libraries 
of human stomach cancer revealed the structure consisting 
10 of a 100-bp 5' -nontranslation region, an 8l0-bp ORF, and a 
2028-bp 3' -nontranslation region. The ORF codes for a 
protein consisting of 2 69 amino acid residues and there 
existed seven putative transmembrane domains. Figure 1 
depicts the hydrophobicity/hydrophilicity profile, obtained 
15 by the Kyte-Doolittle method, of the present protein- In 
vitro translation resulted in formation of a smear 
translation product of a high molecular weight. 
[0038] 

The search of the protein data base by using the 
20 amino acid sequence of the present protein revealed that 
the protein was analogous to the mouse Surf ^4 protein (PIR 
Accession No, A34727) # Table 2 shows the comparison of the 
amino acid sequence between the human protein of the 
present invention (HP) and the mouse Surf-4 protein (MM) . 
2 5 Therein, the marks of * and . represent an ammo acid. 
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residue identical with the protein of the present invention 
and an amino acid residue analogous to the protein of the 
present invention, respectively. The both proteins 
possessed a homology of 99.3% in the entire region. 
5 [0039] 
[Table 2] 
Table 2 

HS MGQJTOIJ^'AEOFADQFLRVTKQYUF'nVARLCLISTFl^IXiIIWWFQWSEQRDYIDTTWN 

MM MGQNDLMGTAEDFaDQFLRVTKQYLPHVaRLCLI STFLEDG I RMWFQWSEQRPY IDTTWS 
HS CGYLUSSFVFLNLLGQLTGCVLVLSRNFVQYACFGLFGI IALQTIAYSILWDLKFLMRN 

MM CGYLUSSFVFLNLLGQLTGCVLVLSRNFVQYACFGLFGI lALQTlAYSILWDLKFLMRN 
15 HS ULG{JGLLLLUESRSEGKSMFAGVPTMI^SSPKQYMQLGGR\^LVLMFMTLLHFDASFF 

MM LALCKMLLLLUESRSEGKSMFAGVPTMRESSPKQYWQLGGRVLLVli^TLLHFDASFF 
HS SlVQNIVGTALMILVAlGFKTKimTLVmFAimnfFNAFWTlPVYKPMnDFLKYOFF 

20 MM S 1 1 QNI VGTALMI LVAlGFKTKlAALTLVVWLFAiNVYFNAFWTIPVYKPMtiDFLKYDFF 
HS QTMSV IGGLLLWALGPGGVSMDEKKKEW 
***************************** 
MM QTMSVICGLLLWALGPGGVSMDEKKKEW 

25 

[0040] 

Furthermore, the search of the GenBank using the 
base sequences of the present cDNA has revealed the 
registration of a base sequence that exhibited an analogy 
30 of 98.6% with a 7 62-bp part from position 122 up to 
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position 883 (GenBank Accession No. Y1482Q) , which codes 
for the fragment of the present protein, 
[0041] 

The mouse Surf-4 protein is one of proteins which 
5 are encoded in the mouse surfeit locus and has been 
considered to a housekeeping protein that is essential to 
the survival of cells [Huxley, C. et al., Mol- Cell* Biol. 
10: 605-614 (1990) ] , 
[0042] 

10 <HP01862> (Sequence Nos. 2, 5 and 8) 

Determination of the whole base sequence of the 
cDNA insert of clone HP01862 obtained from cDNA libraries 
of human stomach cancer revealed the structure consisting 
of an 80-bp 5' -nontranslation region, a 936~bp ORF, and a 

15 1274-bp 3' -nontranslation region. The ORF codes for a 
protein consisting of 311 amino acid residues and there 
existed seven transmembrane domains. Figure 2 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the 
Kyte-Doolittle method, of the present protein . In vitro 

20 translation resulted in formation of a smear translation 
product of a high molecular weight. 
[0043] 

The search of the protein data base using the 
amino acid sequence of the present protein has revealed the 
25 presence of sequences that were analogous to the rat NMDA 
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receptor glutamate-binding subunxt (GenBank Accession No. 
S19586) . Table 3 shows the comparison of the amino acid 
sequence between the human protein of the present invention 
(HP) and the rat NMDA receptor glutamate-binding subunit 
5 (RN) . Therein, the marks of -, *, and . represent a gap, an 
amino acid residue identical with the protein of the 
present invention, and an amino acid residue analogous to 
the protein of the present invention, respectively. The 
both proteins possessed a homology of 41.0%. 
10 [0044] 
[Table 3] 
Table 3 

H S MSNPSAPPPYEDRNP 

15 

RN MKRVSWSLGTAl LPQTUIWGHKPLCLPMFSLPTLGPHTHRPLSSPLPMVNQGtPMVPV 
HS LVPGPLPPGGyGQPSVLPGGYPAYPGYPQPGYGHPAGYPQPMPPTHPMPl^GPGHGYDG 

RN PITRWLPLKDLLKEATHQGHYPQSP-FPPNPyGQPPPfQOPGSPQHGNYQEEGPPSyYDN 
20 HS EERAVSDSFGPGEWODRKVRHTFIRKVYSXISVQLLITVAIIAIFT^EPVSAFVRRNVA 

***** ** . * ******. *. . *** ** 

w Q D FPSVNW-DKSIRQAFIRKVFLVLTLQLSVTLSTVAIFTFVGEVKGFVRANV* 

HS VYYVSYAVFVVTYLIUCCQGPRRRFPWNIILLTLFTFAMGFMTGTISSMYQTKAVUAM 

w ******. * . . ..*.**. *. . ***. - *. . . * *.**.*.*.*. *. 

25 RN TYYVSYAIFFISLIVLSCCGDFRKKHPWNLVALSILTISLSYMVGMIASFYNTEAVIMAV 
HS IITAWSISVTIFCFQTKVDFTSCTGLFCVLGIVLLVTGIVTSIVLYFQYVYWLHMLYAA 

**..*...*. **. . **. ***** *. , * .**...*. *...**. 

RN GITTAVCFTVVIFSMQTRYDFTSCMGVLLVSVWLFIFAIL — CIFIRNRI-LEIVYAS 
HS LGaICFTLFLaYDTQLVLGNRKHTISPEDYITCaLQIYTDIIYIFTFVLQLMGORN 
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RN LGALLFTCFLAVDTQLLLGNKQLSLSPEEYVFAALNLYTDI INIFLYILTI IGRSQGIGQ 


5 [0045] 

Furthermore, the search of the GenBank using the 
base sequences of the present cDNA has revealed the 
presence of sequences that possessed a homology of 90% or 
more (for example. Accession No, H06014) in EST, but any of 
10 the sequences was shorter than the present cDNAs and was 
not found to contain the initiation codon- 
[0046] 

The rat NMDA receptor glutamate-binding subunit 
is one of subunits of an NMDA receptor complex which exist 

15 specifically in the brain [Kumar, K- N. et al,, Nature 354: 
70-73 (1991) ]• The protein of the present invention has 
seven transmembrane domains characteristic to channels and 
transporters and thereby is considered to play a role as a 
channel and a transporter . 

20 [0047] 

<HP10493> (Sequence Nos. 3, 6 and 9) 

Determination of the whole base sequence of the 
cDNA insert of clone HP10493 obtained from cDNA libraries 
of the human lymphoma U937 revealed the structure 

25 consisting of a 123-bp 5' -nontranslation region, a 1152-bp 
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ORF, and a 2430-bp 3' -nontranslation region. The ORF codes 
for a protein consisting of 383 amino acid residues and 
there existed one transmembrane domain at the N-terminus. 
Figure 3 depicts the hydrophobicity/hydrophilicity profile, 
5 obtained by the Kyte-Doolittle method, of the present 
protein. Introduction of an expression vector, wherein the 
Hindll I-AccI fragment containing a cDNA portion coding for 
the N-terrainal 44 ammo acid residues of the present 
protein was inserted into the Hindlll-PmaCI site of pSSD3, 

10 into the COS7 ceils revealed the urokinase activity on the 
cell surface to indicate that the present protein is the 
type-II membrane protein. In vitro translation resulted in 
formation of a translation product of 43 kDa that was 
almost consistent with the molecular weight of 43,001 

15 predicted from the ORF. 

[0048] 

The search of the protein data base using the 
amino acid sequence of the present protein has not revealed 
the presence of any known protein having an analogy. The 

20 search of the motif sequences has revealed a high 
probability that histidine at position 175 is an active 
site of the trypsm-type serine protease. Accordingly, the 
present protein is likely to be a membrane-type protease - 
Also, the search of the GenBank using the base sequences of 

25 the present cDNA has revealed the presence of sequences 
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chat possessed a homology of 90% or more (for example, 
Accession No- R81003) in EST, but many sequences were not 
distinct and the same OKF as that in the present cDNA was 
not found, 
5 [0049] 

Effects of the Invention: 

The present invention provides human proteins 
having transmembrane domains/ cDNAs coding for these 
proteins, and expression vectors of said cDNAs as well as 

10 eucaryotic cells expressing said cDMAs . All of the proteins 
of the present invention exist in the cell membrane, so 
that they are considered to be proteins controlling the 
proliferation and the differentiation of the cells . 
Accordingly/ the proteins of the present invention can be 

15 employed as pharmaceuticals such as carcinostatic agents 
relating to the control of the proliferation and the 
differentiation of the cells or as antigens for preparing 
antibodies against said proteins. The cDNAs of the present 
invention can be utilized as probes for the gene diagnosis 

20 and gene sources for the gene therapy- Furthermore, the 
cDNAs can be utilized for large-scale expression of said 
proteins. Cells, wherein these membrane protein genes are 
introduced and membrane proteins are expressed in large 
amounts, can be utilized for detection of the corresponding 

25 ligands, screening of novel low-molecular pharmaceuticals, 
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and so on, 

[0050] 
Sequence Listing: 
SEQ ID NO: 1 
5 LENGTH; 269 

TYPE: Amino acid 
TOPOLOGY: Linear 
MOLECULE TYPE: Protein 

HYPOTHETICAL: No 

10 ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 

CELL TYPE: Stomach cancer 

CLONE: HP01207 

SEQUENCE DESCRIPTION: 

15 Mel Gly Gin Asn Asp Leu Wet Gly Thr Ala Glu Asp Phe Ala Asp Gin 
15 10 15 

Phe Uu Are Val Thr Lys Gin Tyr Leu Pro His \tel Ala Arg Leu Cys 

20 25 30 

Leu lie Ser Thr Phe Leu Glu Asp Gly He Arg Met Trp Phe Gin Trp 
20 35 40 45 

. Ser Glu Gin Arg Asp Tyr He Asp Thr Thr Trp Asn Cys Gly Tyr Leu 
50 55 60 

Leu Ala Ser Ser Phe Val' Phe Leu Asn Leu Leu Gly Gin Leu Thr Gly 
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65 70 75 80 

Cys Val Leu Val Leu Ser Arg Asa Phe Val Gin Tyr Ala Cys Phe Gly 

85 90 95 

Leu Phe Gly lie lie Ala Leu Gin Thr He Ala Tyr Ser lie Leu Trp 
5 100 105 110 

Asp Leu Lys Phe Leu Mel Are Asn Leu Ala Leu Gly Gly Gly Leu Leu 

U5 120 125 

Leu Leu Leu Ala Glu Ser Arg Ser Glu Gly Lys Ser Met Phe Ala Gly 
130 135 140 

10 Val Pro Thr Met Arg Glu Ser Ser Pro Lys Gin Tyr He* Gin Leu Gly 
H5 150 155 160 

Gly Arg Val Leu Leu Val Leu Met Phe Met Thr Leu Leu Mis Phe Asp 

165 170 175 

Ala Ser Phe Phe Ser He Val Gin Asn He Val Gly Thr Ala Leu Met 
15 180 185 190 

He Leu Val Ala He Gly Phe Lys Thr Lys Leu Ala Ala Le u Thr Leu 

195 200 205 

Val Val Trp Leu Phe Ala lie Asn Val Tyr Phe Asn Ala Phe Trp Thr 
210 215 220 

20 He Pro Val Tyr Lys Pro Met His Asp Phe Leu Lys Tyr Asp Phe Phe 
225 230 235 240 

Gin Thr Met Ser Val He Gly Gly Leu Leu Leu Val Val Ala Leu Gly 

245 250 255 

Pro Gly Gly Val Ser Mei Asp Glu Lys Lys Lys Glu Trp 
25 260 265 
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[0051] 
SEQ ID NO; 2 
LENGTH: 311 
TYPE: Amino acid 
5 TOPOLOGY: Linear 

MOLECULE TYPE; Protein 

HYPOTHETICAL: No 

ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
10 CELL TYPE: Stomach cancer 

CLONE: HP01862 
SEQUENCE DESCRIPTION; 

Met Ser Asn Pro Ser Ala Pro Pro Pro Tyr Glu Asp Arg Asn Pro Leu 
15 10 15 

15 Tyr Pro Gly Pro Leu Pro Pro Gly Gly Tyr Gly Gin Pro Ser Val Leu 
20 25 30 

Pro Gly Gly Tyr Pro Ala Tyr Pro Gly Tyr Pro Gin Pro Gly Tyr Gly 

35 40 45 

His Pro Ala Gly Tyr Pro Gin Pro Wet Pro Pro Tbr His Pro Met Pro 
20 50 55 60 

Mex Asn Tyr Gly Pro Gly His Gly Tyr Asp Gly Glu Glu Arg Ala Val 
65 70 75 80 

Ser Asp Ser Phe Gly Pro Gly Glu Trp Asp Asp Arg Lys Val Arg His 
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10 


15 


20 


25 


85 90 95 

Thr Phe He Arg Lys Val Tyr Ser lie lie Ser Vai Gin Leu Leu He 

100 105 no 

Thr Val Ala lie He Ala He Phe Thr Phe Val Glu Pro Val Ser Ala 

U5 120 126 

Phe Val Arg Arg Asn Val Ala Val Tyr Tyr Val Ser Tyr Ala Val Phe 

130 135 HO 

Val Val Thr Tyr Leu IU Leu Ala Cys Cys Gin Gly Pro Arg Are Arg 
H5 150 155 160 

Phe Pro Trp Asn lie lie Leu Leu Thr Leu Phe Thr Phe Ala Met Gly 

f 

165 170 175 

Phe Met Thr Gly Thr lie Ser Ser Met Tyr Gin Thr Lys Ala Val He 

180 185 190 

He Ala Wet He lie Thr Ala Val Val Ser He Ser Val Thr lie Phe 

195 200 205 

Cys Phe Gin Thr Lys Val Asp Phe Thr Ser Cys Thr Gly Leu Phe Cys 

210 215 220 

Vai Leu Gly lie Val Leu Leu Val Thr Gly lie Vai Thr Ser lie Val 
225 230 235 240 

Leu Tyr Phe Gin Tyr Val Tyr Trp Leu His Met Leu Tyr Ala Ala Le w 

245 250 255 

Gly Ala He Cys Phe Thr Leu Phe Leu Ala Tyr Asp Thr Gin Leu Val 

260 265 270 

Leu Giy Asn Arg Lys His Thr He Ser Pro Glu Asp Tyr He Thr Gly 
275 280 285 
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Ala Leu Gin lie Tyr Thr Asp He lie Tyr He Phe Thr Phe fid Leu 

290 295 300 

Gin Leu Wei Gly Asp Arg Asa 
305 310 
5 [0052] 
SEQ ID NO: 3 
LENGTH: 38 3 
TYPE: Amino acid 
TOPOLOGY: Linear 
10 MOLECULE TYPE; Protein 

HYPOTHETICAL: No 

ORIGINAL SOURCE: 

ORGANISM; Homo sapiens 

CELL TYPE: Lymphoma 

15 CELL LINE: U937 

CLONE: HP10493 

SEQUENCE DESCRIPTION: 

Met Ala Gly lie Pro Gly Leu Leu Phe Leu Leu Phe Phe Leu Leu Cys 
15 10 15 

20 Ala Val Gly Gin Val Ser Pro Tyr Ser Ala Pro Trp Lys Pro Tfcr Trp 
20 25 30 

Pro Ala Tyr Arg Leu Pro Val Val Leu Pro Gin Ser Thr Leu Asn Lei* 
35 40 45 
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Ala Lys Pro Asp Phe Giy Ala Glu Ala Lys Leu Glu Val Ser Ser Ser 

50 55 60 

Cys Gly Pro Gin Cys His Lys Giy Thr Pro Leu Pro Thr Tyr Glu Glu 
65 70 ?5 SO 

5 Ala Lys Girt Tyr Leu Ser Tyr Glu Thr Leu Tyr Ala Asn Gly Ser Arg 

85 90 95 

Thr Glu Thr Gin Val Ciy He Tyr He Leu Ser Ser Ser Gly Asp Gly 

100 105 110 

Ala Gin His Arg Asp Ser Gly Ser Ser Gly Lys Ser Arg Are Lys Arg 
10 115 120 125 

Gin lie Tyr Gly Tyr Asp Ser Arg Phe Ser He Phe Gly Lys Asp Phe 

130 135 140 

Leu Leu Asn Tyr Pro Phe Ser Thr Ser Val Lys Leu Ser Thr Gly Cys 
145 150 155 160 

15 Thr Gly Thr Leu Val Ala Glu Lys His Val Leu Thr Ala Ala His Cys 

165 170 175 

He His Asp Gly Lys Thr Tyr Val Lys Gly Thr Gin Lys Leu Arg Val 

180 185 190 

Gly Phe Leu Lys Pro Lys Phe Lys Asp Gly Gly Arg Gly Ala Asn Asp 
20 195 200 205 

Ser Thr Ser Ala Met Pro Glu Cln Met Lys Phe Gin Trp He Arg Val 

210 215 220 

Lys Arg Thr His Val Pro Lys Gly Trp He Lys Gly Asa Ala Asn Asp 
225 230 235 2*0 

25 ' lie Gly Met Asp Tyr Asp Tyr Ala Leu Leu Glu Leu Lys Lys Pro His 
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245 250 255 

Lys Are Lys Phe Met Lys He Gly Val Ser Pro Pro Ala Lys Gin Leu 

260 265 270 

Pro Gly Gly Arg lie His Phe Ser Gly Tyr Asp Asa Asp Are Pro Gly 
5 275 280 285 

Asn Leu Val Tyr Arg Phe Cys Asp Val Lys Asp Glu Thr Tyr Asp Leu 

290 295 300 

Lea Tyr Gin Gin Cys ASP Ala GJ n Pro Gly Ala Ser Gly Ser Gly Val 
305 310 315 320 

10 Tyr Val Are Met Trp Lys Arg Gin Gin Gin Lys Trp Glu Arg Lys lie 

325 330 335 

lie Gly lie Phe Ser Gly HU Gin Trp Val Asp Met Asn Ciy Ser Pro 

340 345 350 

Gin Asp Phe Asn Val Al& Val Arg lie Thr Pro Leu lys Tyr Ala Gin 
15 355 360 365 

lie Cys Tyr Trp lie Lys Gly Asn Tyr Leu Asp Cys Arg Glu Gly 
370 375 380 

[0053] 
SEQ ID NO: 4 
20 LENGTH: 807 

TYPE: Nucleic acid 

STRANDEDNESS : Double 

TOPOLOGY: Linear 

MOLECULE TYPE: cDNA to roRNA 
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ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
CELL TYPE: Stomach cancer 
CLONE: HP01207 
5 SEQUENCE DESCRIPTION: 


ATGGCCCAGA 

acgacctgat 

GGGCACGGCC GAGGACTTCG CCGACCAGTT CCTCCGTGTC 

60 

aCAAAGCAGT 

ACCTOCCCCA 

CGTGGCGCGC CTCTGTCTGA TCAGCACCTT CCTGGAGGAC 

120 



(vrMiAfim a (i CAcmcdACT acatcCiaCaC cxcctg&aac 

ISO 
low 

TGCCGCTACC 

TGCTGGCCTC 

GTCCTTCGTC TTCCTCAACT TGCTGGGaCa GCTGACTGGC 

240 

TGCGTCCTGG 

TGTTGAGCAG 

GAACTTCGTG CAGTACGCCT GCTTCGGGCT CTTTGGAATC 

300 

ATAGCTCfGC 

AGACGATTGC 

CTACAGCATT TTATGGGACT TGAAGTTTTT GATGACGAAC 

360 

CTGGCCCTGG 

GAGCaCGCCT 

GTTGCTGCTC CTAGCAGAAT CCCCTTCTGA AGGGAAGAGC 

420 

ATGTTTGCGG 

GCGTCCCCAC 

CATGCGTGAG AGCTCCCCCA AACAGTACAT GCAGCTCGGA 

480 

GGCAGGGTCT 

TGCTGGTTCT 

GATGTTCATG ACCCTCCTTC AOTTGACGC CAGCTTCTfT 

540 

TCTATTGTCC 

AGAaCaTCGT 

GGGCACAGCT CTGATGATTT TAGTGGCCAT TGGTTTTAAA 

600 

ACCAAGCTGG 

ctgctttgac 

TCTTGTTGTG TGGCTCTTTG CCATCAACGT aTaTTTCaAC 

660 

GCCTTCTGGA 

CCATTCCAGT 

CTACAAGCCC aTGCaTGaCT TCCTGAAATA CGACTTCTTC 

720 

CAGACCATGT 

CGGTGATTGG 

GGGCTTGCTC CTGGTGGTGG CCCTGGGCCC TGGGGGTGTC 

780 

TCCATGGATG 

AGAAGAAGAA 

GGaGTGG 

807 


[0054] 




SEQ ID NO: 5 


LENGTH: 933 

TYPE: Nucleic acid 
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STRANDEDNESS: Double 
TOPOLOGY: Linear 
MOLECULE TYPE: cDNA to mRNA 
ORIGINAL SOURCE: 
5 ORGANISM; Homo sapiens 

CELL TYPE: Stomach cancer 

CLONE; HP01862 


SEQUENCE DESCRIPTION: 


ATGTCCAACC 

CCAGCQCCCC 

ACCACCATAT 

GAAGACCGCA 

ACCCCCTGTA 

CCCACGCCCT 

60 

CTGCCCCCTG 

CGGGCTATGG 

GCAGCCATCT 

GTCCTGCCAG 

GAGGGTATCC 

tgcctaccct 

120 

GGCTACCCGC 

AGCCTGGCTA 

CGGTCACCCT 

GCTGGCTACC 

CACACCCCAT 

GCCCCCCACC 

180 

CACCCGATGC 

CCATGAACTA 

CGGCCCAGGC 

CATGGCTATG 

ATGGGGAGGA 

cagagcggtg 

240 

AGTGATAGCT 

TCGGGCCTGG 

AGAGTGGGAT 

GACCGGAAAG 

tgcgacaCac 

TTTTATCCGA 

300 

AAGGTTTACT 

CCaTCATCTC 

CGTGCAGCTG 

aCATCACTG 

TGGCCATCAT 

TGCTATCTTC 

360 

ACCTTTGTGG 

AACCTGTCAG 

CGCCTTTGTG 

AGGAGAAATG 

TGGCTGTCTA 

CTACGTGTCC 

420 

TATGCTGTCT 

TCGTTGTCAC 

CTACCTGATC 

CTTGCCTGCT 

GCCAGGGACC 

CAGACGCCGT 

480 

TTCCCATGGA 

ACATCATTCT 

GCTGACCCrf 

TrTACTTTTG 

CCATGGGCTT 

CATGACGGGC 

540 

ACCATTTCCA 

GTATGTACCA 

AACCAAAGCC 

gtcatcattg 

CAATGATCAT 

CACTGCGGTG 

600 

GTATCCATTT 

CAGTCACCAT 

CTTCTGCfTT 

cagaccaagg 

TGGACTTCAC 

CTCGTGCACA 

660 

GGCCTCTTCT 

GTGTCCTGGG 

AATTGTGCfC 

ctggtgactc 

CGATTGTCAC 

TaGCaTTCTG 

720 

CTCTACTTCC 

AATACGITTA 

CTGGCTCCAC 

ATGCTCTATG 

CTGCfCfGGG 

GGCCaTTTGT 

780 

TTCACCCTCT 

TCCTCGCTTa 

CGACACACAG 

CTGGTCCTGG 

GGAACCGGAA 

CCACaCCaTC 

840 

AGCCCCGAGG 

ACTACATCAC 

TGCCGCCCTG 

CAGATTTACA 

CAGACATCAT 

CTACATCTTC 

900 
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ACCTTTGTGC TGCAGCTCaT GGGGGATCGC AAT 933 

[0055] 
S£Q ID NO: 6 
LENGTH: 1149 
5 TYPE: Nucleic acid 

STRANDEDNESS : Double 

TOPOLOGY: Linear 

MOLECULE TYPE: cDNA to mRNA 

ORIGINAL SOURCE: 

10 ORGANISM: Homo sapiens 

CELL TYPE: Lymphoma 

CELL LINE: U937 

CLONE: HP10493 
SEQUENCE DESCRIPTION; 

15 ATGGCAGGGA ttccagggct cctcttcctt ctcttctttc TGCTCTGTGC TGTTGCGCAA 60 

GTGAGCCCTT ACAGTGCCCC CTGGAAACCC ACTTGGGCTG CATACCGCCT CCCTGTCGTC 120 

TTGCCCCaGT CTACCCTCaA TTTAGCCAAG CCAGACTTTG GAGCCGAAGC CAAATTAGAA 180 

GTATCTTCTT CATGTGGACC CCAGTGTCAT AAGGGAACTC CaCTGCCCAC TTaCGAAGAG 240 

GCCAAGCAAT ATCTGTCrfA TGAAACGCTC TATGCCAATG GCAGCCGCaC AGAGACGCaG 300 

20 GTGGGCATCT ACATCCTCAG CAGTAGTGGA GATGGGGCCC AACACCGAGA CTCAGGGTCT 360 

TCAGGAAAGT CTCGAAGGAA GCGGCACATT TATGGCTATG ACaGCAGGTT CaGCaTTTTT 420 

CGGAAGGACT TCCTGCTCaa CTaCCCTTTC TCAACATCAG TGAAGTTATC CACGGGCTGC 480 
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ACCGGCACCC TGGTOCCAGA GAaGCATGTC CTCACaGCTG CCCACTGCAT ACACGATGGA 5*0 

AAMCCTATC TGAAAGGAAC CCaGAAGCTT CGAGTGGGCT TCCTAAAGCC CAAGTTTAAA 600 

GATGGTGGTC GAGGGGCCAA CGACTCCACT TCAGCCATGC CCCAGCAGaT GAAATTTCAG 660 

TGGATCCGGG TGAAACGCAC CCaTCTGCCC AAGGGTTGGA TCaaGGGCAA TGCCAATGAC 720 

5 ATCGGCATGG ATTATGATTA TGCCCTCCTG GAACTCA/VAA AGCCCCaCaA GAGAAAATTT 780 

ATGAAGATTG GGGTGAGCCC TCCTGCTAAG CAGCTGCCAG GGGGCAGAAT TCACTTCTCT 840 

GGTTATGACA ATGACCCACC AGGCAATTTG GTGTATCGCT TCTGTGACGT CAAAGACGAG 900 

ACCTATGACT TGCTCTACCA GCAATGCGAT GCCCAGCCAG GGGCCAGCGG GTCTGGGGTC 960 

TATGTGAGGA TGTGGAAGAC ACAGCAGCAG aagtgggagc GAAAAATTAT TGGCATTTTT 1020 

10 TCAGGGCACC AGTGGGTGGA CATGAATGGT TCCCCACAGG ATTTCAACGT GGCTGTCAGA 1080 

ATCACTCCTC TCAAATATGC CCAGATTTGC TATTGGATTA AAGCAAACTA CCTGGATTGT 1140 

AGCGAGGGG 1H9 
SEQ ID NO: 7 
LENGTH: 2938 

15 TYPE; Nucleic acid 

STRANDEDNESS : Double 
TOPOLOGY: Linear 
MOLECULE TYPE; CDNA to iuRNA 
ORIGINAL SOURCE: 
20 ORGANISM: Homo sapiens 

CELL TYPE: Stomach cancer 

CLONE: HP01207 
FEATURES; 
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NAME/KEY: CDS 
LOCATION: 1 01.. 910 
IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: 
5 AAAAAGGGCA CTTCCTGTGG AGGCCGCACC GCGTGCGGGC GCCGaCGGGC GaGAGCCAGC 60 

GAGCGAGCGA gcgagccgag ccgagcctcc cgccgtcgcc atg ggc cag aac GAC 115 

Met Gly Gin Asn Asp 
1 5 

CTG ATG GGC aCG GCC GAG GAC TTC GCC GaC CAG TTC CTC CGT GTC ACA 163 
10 Leu Mel Gly Thr Ala Glu Asp Phe Ala Asp Gin Phe Leu Arg Val Thr 

10 15 . 20 

AAG CAG TAC CTG CCC CAC GTG GCG CGC CTC TGT CrG ATC AGC ACC TTC 211 
Lys Gin Tyr Leu Pro His Val Ala Arg Leu Cys Leu lie Ser Thr Phe 
25 30 35 

15 CTG GAG GAC GGC ATC CGT ATG TGG TTC CAG TGG AGC GAG CAG CGC GAC 259 
Leu Glu Asp Gly lie Arg Met Trp Phe Gin Trp Ser Clu Gin Arg Asp 

40 45 50 

TAC ATC CAC ACC ACC TGG AAC TGC GGC TAC CTG CTG GCC TCG TCC TTC 307 
Tyr He Asp Thr Thr Trp Asn Cys Gly Tyr Leu Leu Ala Ser Ser Phe 
20 55 60 65 

GTC TTC CTC AAC TTG CTG GGA CaG CTG ACT GGC TGC GTC CTG GTG TTG 355 
Val Phe Leu Asn Leu Leu Gly Gin Leu Thr Gly Cys Val Leu Val Leu 
70 75 80 85 

AGC AGG AAC TTC GTG CaG TAC GCC TGC TTC GCG CTC TTT GGa ATC ATA 403 


Received from < 6177424214 > at 4/26/02 11:41 :44 AM [Eastern Daylight Time] 


Apr-26-02 10:35 


From-LAHIVE I COCKF I ELD, LLP 


6177424214 


T-604 P. 48/64 F-964 


43 


15 


20 


Ser Arg Aso Phe Val Gin Tyr Ala Cys Phe Cly Leu Phe Gly lie lie 

90 95 100 

GCT CTG CAG ACG ATT GCC TAC AGC ATT TTA TGG GaC TTG AAG TTT TTG 451 
Ala Leu Gin Thr He Ala Tyr Ser He Leu Trp Asp Leu Lys Phe Leu 
5 105 UO 115 

ATG AGG AAC CTG GCC aG GGA GGA GGC CTG TTG CTG CTC CTA GCA CM 499 
Met Arg Asn Leu Ala Leu Gly Gly Gly Leu Leu Leu Leu Leu Ala Glu 

120 125 130 

TCC CGT TCT GAA GGG AAG AGC ATG TTT GCG GGC GTC CCC ACC ATG CGT 547 
10 Ser Arg Ser Glu Gly Lys Ser Mei Phe Ala Gly Val Pro Thr Met Arg 
135 140 145 

GAG AGC TCC CCC AAA CaG TAC ATG CAG CTC GGA GGC AGG GTC TTG CTG 595 
Glu Ser Ser Pro Lys Gin Tyr Mei Gin Leu Gly Gly Arg Val Leu Leu 
150 155 160 165 

GTT CTG ATG TTC ATG ACC CTC CTT CaC TTT GAC GCC AGC TTC TTT TCT 643 
Val Leu Mei Phe Met Thr Leu Leu His Phe Asp Ala Ser Phe Phe Ser 

170 175 180 

ATT GTC CAG AAC ATC GTG GGC ACA GCT CTG ATG ATT TTA G'fG GCC ATT 691 
He Val Gin Asn He Val Gly Thr Ala Leu Mei He Leu Val Ala He 

185 190 195 

GGT TTT AAA ACC AAG CTG GCT GCT TTG ACT CTT GTT GTG TGG CTC TTT 739 
Gly Phe Lys Thr Lys Leu Ala Ala Leu Thr Leu Val Val Trp Leu Phe 

200 205 210 

GCC ATC AAC GTA TAT TTC AAC GCC TTC TGG ACC ATI' CCA GTC TAC AAG 787 
2 5 Ala He Asn Val Tyr Phe Asn Ala Phe Trp Thr lie Pro Val Tyr Lys 
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215 220 225 

CCC ATG CAT GAC TfC CTG AAA TAG GaC TTC TTC CAG ACC ATG TCG GTG 835 
Pro Mei His Asp Phe Leu Lys Tyr Asp Phe Phe Gin Thr Met Ser Val 
230 235 2*0 245 

5 ATT GGG GGC TTC CTC CTG GTG GTG GCC CTG GGC CCT GGG GGT GTC TCC 883 
He Gly Gly Uu Leu Leu Val Val Ala Lea Gly Pro Gly Gly Val Ser 

250 255 260 

ATG GAT GAG AAG AAG AAG GAC TGG TAA CAGTCACAGA TCCCTACCTG 930 
Met Asp Glu Lys Lys Lys Glu Trp 

10 265 

CCTGGCTAAG ACCCGTGGCC GTCAAGGACT GCTTCGGGGT GGATTCAACA AAACTCCCAC 990 

CTTTTATGTA TCCTCTTCCC TTCCCCTCCC TTGGTAAAGG CACAGATGTT TTGAGAACTT 1050 

TATTTGCAGA GACACCTGAG AATCGATGGC TCAGTCTGCT CTGGAGCCAC AGTCTGGCGT 11 10 

CTGACCCTTC AGTCCAGGCC AGCCTGGCAG CTGGAAGCCT CCCCCACGCC CAGGCTTTGG 1170 

15 AGTGAACAGC CCGCTTGGCT GTGGCATCTC AGTCCTATTT TTGAGTTTTT TTGTGGGGGT 1230 

ACAGGAGGGG GCCTTCAAGC TGTACTGTGA GCAGACGCAT TGGTATTATC ATTCAAAGCA 1290 

GTCTCCCTCT TaTTTGTAAG TTTACATTTT TAGCGGAAAC TACTAAATTA TTTTGGGTGG 1350 

TTCAGCCAAA CCTCAAAACA GTTAATCTCC CTGGTTTAAA ATCaCACCAG TGGCTTTGAT 1410 

GTTGTTTCTG CCCCGCATTG TATTTTATAG GAATACTGAA AACATTTaGG GACACCCAAA 1470 

20 GAATGATGCA GTATTAAAGG GGTGGTAGAA GCTGCTGTTT ATGATAAAAG TCATCGGTCa 1530 

GAAAATCAGC TTGGATTGGT GCCAAGTGTT TTATTGGGTA ACACCCTGGG AGTTTTAGTA 1590 

GCTTGAGGCA AGGTGGAGGG GCAAGAAGTC CTTGGGCAAG CTGCTGGTCT GGGTGCTGCT 1650 

GGCCTCCAAG CrGCCAGTGC GAAGGGCTAG TGAGACCACA CaGGGGTAGC CCCAGCAGCA 1?10 

GCaCCCTGCA AGCCAGCCTC GCCaGCTGCT CAGACCAGa TGCAGAGCCG CAGCCGCTGT 1770 

25 GGGCAGGGGG TGTGGCAGGA GCTCCCAGCA CTGGaGaCCC ACGGACTCAA CCCAGTTACC 1830 
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TCACATGGGG CCTTTTCTGA GCAAGCTCTC 

ccGcccrnc ccagctgcac tcgccctgtg 

TGTCGCTCAC TCAGaTTGTC CGTTTGCTAT 
TTTCTGATGC CTTACCGATT TGATCTfAAT 
5 ATACTGTGTT TCTCTTTTTG GGGGAGCTTA 
TAGTAAATGC CACAAGGGTA GTCGAACACC 
gctggctcag CCTGTCTCCA GGGCTGCTGC 
caCTCATCC ATTGGCTCTG CAGGGCAGGG 
tggaagtcac ctaccttttt AACACAGCCG 
xo CCTGGTAGCC tacttcctta CCCCCGAATA 
tgggttctct tctcctgtga TCATTCAAGT 
tgtttcaacc tcaccagggc tgtctcttgg 

GACAGCCCCC ATCAAATGAC CTTGGCCmG 

gctgattggt GGAAACTAGG gtggaccaaa 
15 tgcaccagca gcgcctccgt cctagtgggt 
agggcctgat tcgggaagat gcctttgcag 
gattctggca aaacaatttc taagattttt 
cattttatgc tgtattttat atcttagttg 
catcaaaata aataatggcg tttgttgt 

20 [0056] 
SEQ ID NO: 8 

LENGTH: 2290 

TYPE: Nucleic acid 

STRANDEDNESS : Double 


45 


GAAAGCGCAG GCCQCCCIGG CTGAGCAGCA 

1890 

GaCAGCCCCG ACACACCACT TTCCTGAGGC 

1950 

GCCCAATGCA gccaaaattc ctttttacaa 

2010 

CCTGTATTfA AAGTTTTCTA ACACfGCCTT 

2070 

ACTGCTTGTT GCfCCCTGTC GTCTGCaCCA 

2130 

TCTCTGGCCC CTAGACCTAT CTGGGGACAG 

2190 

CGCCCAGCCC CGAGCCTGCC TCCCTCTTGG 

225Q 

GTGAGGCAGG TTTCTGCrCA TAAGTGCTTT 

2310 

AACTAGTCCC AACGCGTTTG CAAATATTCC 

2370 

TTGGTAAGAT CGATCAATGG CTTCaGGaCA 

2430 

gctcactgca tgaagactcc cttgtctcag 

2490 

TCCACACCTC GCTCCCTGTT AGTGCCGTAT 

2550 

TCACGCTTTC tctgtggtca aggttggttg 

2610 

ggaggccacg tgagcagtca gcaccagttc 

2670 

GTTCCTGTTT L i LC| IKjLLL i LUu 1 uuut I 

£ I J\J 

GGAGGGGAGG ATAAGTGGGA tctaccaatt 

2790 

TTGCTTTATG TGGGAAACAG aTCTAAATCT 

2850 

TGTTTGAAAA CGTTTTGATT TTTGGAAACA 

2910 


2938 
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TOPOLOGY: Linear 

MOLECULE TYPE: cDNA to inRNA 

ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
5 CELL TYPE: Stomach cancer 

CLONE: HP01862 
FEATURES: 

NAME/KEY; CDS 
LOCATION: 81..10I6 
10 IDENTIFICATION METHOD: E 

SEQUENCE DESCRIPTION; 

ACACTCCGAG GCCAGGAACG CTCCGTCTGG AACGGCGCAG GTCCCAGCAG CTGGGGTTCC 60 
CCCTCAGCCC GTGAGCAGCC ATG TCC AAC CCC AGC GCC CCA CCA CCA TAT GAA 113 

Met Ser Asn Pro Ser Ala Pro Pro Pro Tyr Clu 
15 1 5 10 

GAC CGC AAC CCC CTG TAC CCA GGC Ca CTG CCC CCT GGG CCC TAT GGG 161 
Asp Arg Asn Pro Lea Tyr Pro Gly Pro Leu Pro Pro Gly Gly Tyr Gly 

15 20 25 

CAG CCA TCT GTC CTG CCA GGA GGG TAT CCT GCC TAC CCT GGC TAC CCG 209 
20 Gin Pro Ser Val Leu Pro Gly Gly Tyr Pro Ala Tyr Pro Gly Tyr Pro 
30 35 40 

CAG CCT GGC TAC GGT CAC CCT GCT GGC TAC CCA CaG CCC ATG CCC CCC 257 
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Gin Pro Gly Tyr Gly His Pro Ala Gly Tyr Pro Gin Pro ttet Pro Pro 

45 50 55 

ACC CAC CCG ATG CCC ATG AAC TAC CCC CCA GGC CAT GGC TAT GAT GGG 305 
Thr His Pro Met Pro Met Asn Tyr Gly Pro Gly His Gly Tyr Asp Gly 
$ 60 65 70 75 

GAG GAG AGA GCG GTG AGT GAT AGC TTC GGG CCT GGA GAG TGG GAT GAC 353 
Glu Glu Arg Ala Val Ser Asp Ser Phe Gly Pro Gly Glu Trp Asp Asp 

80 85 90 

CGG AAA GTG CGA CAC ACT TTT ATC CGA AAG GTT TAC TCC ATC ATC TCC 401 
10 Arg Lys Val Arg His Thr Phe He Arg Lys Val Tyr Ser lie He Ser 
95 100 105 

GTG CAG CTG CTC ATC ACT GTG GCC ATC ATT GCT ATC TTC ACC TTT GTG 449 
Val Gin Leu Leu lie Thr Val Ala Tie IU Ala He Phe Thr Phe Val 
110 115 120 

15 GAA CCT GTC AGC GCC TTT GTG AGO AGA AAT GTG GCT GTC TAC TAC GTG 497 
Glu Pro Val Ser Ala Phe Val Arg Arg Asn Val Ala Val Tyr Tyr Val 

125 130 135 

TCC TAT GCT GTC TTC GTT GTC ACC TAC CTG ATC CTT GCC TGC TGC CAG 545 
Ser Tyr Ala Val Phe Vul Val Thr Tyr Leu He Leu Ala Cys Cys Gin 
20 140 H5 150 155 

GGA CCC AGA CCC CGT TTC CCA TGG AAC ATC ATT CTG CTG ACC CTT TTT 593 
Gly Pro Arg Arg Arg Phe Pro Trp Asn lie He Leu Leu Thr Leu Phe 

160 165 170 

ACT TTT GCC ATG GGC TTC ATG ACG GGC ACC ATT TCC AGT ATG TAC CAA 641 
25 Thr Phe Ala Met Gly Phe Wei Thr Gly Thr He Ser Ser Met Tyr Gin 
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175 180 185 

ACC AM GCC GTC ATC ATT CCA ATC ATC aTC ACT GCG G'fG GTA ICC ATT 689 
Thr Lys Ala Val lie lie Ala Mel lie He Thr Ala Val Val Ser lie 
190 195 200 

5 TCA GTC ACC ATC TTC TGC TTT CaG ACC AAC GTG GAC TIC ACC TCG TGC 737 
Ser Val Thr He Phe Cys Phe Gin Thr Lys Vai Asp Phe Thr Ser Cys 

205 210 215 

ACA GGC CTC TTC TGT GTC CTG GGA ATT GTG CTC CTG GTC ACT GGG ATT 785 
Thr Gly Leu Phe Cys Val Leu Gly He Val Leu Leu Val Thr Gly Ile 
XO 220 225 230 235 

GTC ACT AGC ATr GTG CTC TAC TTC CAA TAC GTT TaC TGG CTC CaC ATG 833 
Val Thr Ser Ile Val Leu Tyr Phe Gin Tyr Val Tyr Trp Leu His Met 

240 245 250 

C1C TAT GCT GCT CTG GGG GCC ATT TGT TTC ACC CTG TTC CTG GCT TAC 881 
15 Leu Tyr Ala Ala Leu Gly Ala Ile Cys Phe Thr Leu Phe Leu Ala Tyr 
255 260 265 

GAC ACA CAG CTG GTC CTG GGG AAC CGG AAG CAC ACC ATC AGC CCC GAG 929 
Asp Thr Gin Leu Val Leu Gly Asn Arg Lys His Thr He Ser Pro Glu 
270 275 280 

20 GAC TAC ATC ACT GGC GCC CTG CAG ATT TAC ACA GaC ATC ATC TAC ATC 977 
Asp Tyr lie Thr Gly Ala Leu Gin lie Tyr Thr Asp Ile He Tyr Ile 

285 290 295 

TfC ACC TTT GTG d'C CAG CTG ATG GGG GaT CGC AAT TAAGGAG 1020 
Phe Thr Phe Val Leu Gin Leu Met Gly Asp Arg Asn 
25 300 305 310 
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CAAGCCCCCA TTTTCaCCCG atcctggcct 

TGACTGTGGT CTGGGCTTTA GGCCCCTTTC 

TTCTGTCCTG GAGACAGGTG GCCTCTCTGG 

GAGGAGCTAG GGACTAACTG TTGCTCTTGG 

5 GTCTTCTCCC CGCCaCCTAC TGTATGaCaC 

GAATATGAAA agaccctaTT cgatagctag 
aggtcacgag gttcccctcc CACCTactc 
rncncccc cagcaaagcc agagagcttt 

ATCCTGTATT CCTTTGGCTT GGCATCTTTT 
10 ATGGGTCTCC TTGCTTCAAT CCCTTCTTGT 
TAGGGATGGG GGaCaGaTAA TACAACGaGC 
GCATCTCCCC TGGCCTTGCT CCTGGCTTGT 
gccatctgct CCCCATT CTT GAAAGCTGCT 
GTCAGAGTGA ACTCTTGCTT CCTGTATTCA 
15 GTCATACGTG TGGCCaGGAA GTAGCCAGGG 
TGCCTGGGGG TCCCTCACCT GGCTAGGGAG 
TCTGGGTTCT TTCCTTCTGC TAACCCAGCG 
GTTTGTCTTA AGTTGCACTG GGGGATTTCT 
TGCCTTCTTT GTAATATTM GTGCCTTGAG 
20 CTGTCGGGTG GGGGCaGAaa TCAAATCAGC 
TAGAGAAATC CTAACTATAT CACACAAAGG 
TCTAGAAGTT 

[0057] 
SEQ ID NO: 9 

25 LENGTH: 37 05 
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CTCCCTTCCA AGCTAGAGGG CTGGGCCCTa 1080 

CTTCCCCTTG AGTAACATGC CCAGTTTCCT I HO 

CTATGGATGT CTGGGTACTT GCTCGGGACG 1200 

TGGGCTTGGC AGGGACTAGG CTGAAGaTGT 1260 

CACATTCTTC CTAACAGCTG GGGTTGTGAG 1320 

AAGGGAATAT GAAAGGTAGA AGTGAOTCA 1380 

ACAGGCTTCT TGaCTaCGTA GTTGGAGCTA 1440 

G1CCCCGGCC TCCTGGACAC ATAGGCCATT 15Q0 

AGCTCAGGAA GGTAGAAGAG ATCTGTGCCC 1560 

TTCAGTGACA TATGTATTGT TTATCTGGCT 1620 

AAAGTAACCT ATACAGGCCA GCATGGAACA 1680 

GACGCTATAA GACAGAGCaG GCCACATGTG 1740 

GGGGCCTCCT TGCACGCTTC TGGATCTCTG 1800 

GGCAGCTCAG ACCAGAAAGT AAGGGGCAGA i860 

TGAAGAGAGA CTCGGTGCGG GCAGGGAGAA 1920 

ATACCGAAGC CTACTCTGGT ACTGAAGACT 1980 

AGGGTCCTAA GAGGAAGGTG acttctctct 2040 

CACTTGAGGC ccatctctcc AGCCAGCCAC 2100 

CTGGAATGGC GMGGGGGAC AAGGGTCaGT 2160 

CCAAGGATAT AGTTAGGATT AATTACTTAA 2220 

GATACAACTA TAAATGTAAT AAAATTTATG 2280 

2290 
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TYPE; Nucleic acid 
STRANDEDNESS: Double 
TOPOLOGY: Linear 
MOLECULE TYPE: CDNA to mRNA 
5 ORIGINAL SOURCE: 

ORGANISM; Homo sapiens 

CELL LINE: U937 

CLONE: HP10493 
FEATURES : 
10 NAME/KEY: CDS 

LOCATION: 124^.1275 

IDENTIFICATION METHOD; E 
SEQUENCE DESCRIPTION: 

ACTCTCGGCT GTGCGGCGGG GCAGGCATGG GACCCGCGCC CTCTCTCCCG GCGCCCACAC 60 
15 CTGTCTGAGC GGCGCAGCGA QCCGCGGCCC GGGCGGGCTG CTCGGCGCGG AACaGTGCTC 120 
GGC ATG GCA GGG ATT CCA GGG CTC CTC TTC CTT CTC TIC TTT CTG CTC 168 
Met Ala Gly II© Pro Gly Leu Leu Phe Leu Leu Phe Phe Leu Leu 
15 10 15 

TGT GCT GTT GGG CAA GTG AGC CCT TAC AGT GCC CCC TGG AAA CCC ACT 216 
20 Cys Ala Val Gly Gin Val Ser Pro Tyr Ser Ala Pro Trp Lys Pro Thr 

20 25 30 

TGG CCT CCA TAC CGC CTC CCT GTC CTC TIG CCC CAG TCT ACC CTC AAT 264 
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15 


20 


Trp Pro Ala Tyr Arg Leu Pro Val Val Leu Pro Gin Ser Thr Leu Asn 

35 40 45 

TTA GCC AAG CCA GAC TTT GGA GCC GAA GCC AAA TTA GM GTA TCT TCI' 312 
Leu Ala Lys Pro Asp Phe Gly Ala Glu Ala LyS Leu Glu Val Ser Ser 
5 50 55 60 

TCA TGT GGA CCC CAC TGT CAT AAG GGA ACT CCA CTG CCC ACT TAC GAA 360 
Ser Cys Gly Pro Gin Cys His Lys Gly Thr Pro Leu pro Thr Tyr Glu 

65 70 75 

GAG GCC AAG CAA TAT CTG TCT TAT GAA ACG CTC TAT GCC AAT GGC AGC 408 
10 Glu Ala Lys Gin Tyr Leu Ser Tyr Glu Thr Leu Tyr Ala Asn Gly Ser 
80 85 90 95 

CGC ACA GAG ACG CAG CTG GGC ATC TaC ATC CTC AGC AGT ACT GGA GAT 456 
Arg Thr Glu Thr Gin Val Gly He Tyr He Leu Ser Ser Ser Gly Asp 

100 105 110 

GGG GCC CAA CAC CGA GAC TCA GGG TCT TCA GGA AAG TCT CGA AGG AAG 504 
Gly Ala Gin His Arg Asp Ser Gly Ser Ser Gly Lys Ser Arg Arg Lys 

115 120 125 

CGG CAG ATT TaT GGC TAT GAC AGC AGG TTC AGC ATT TTT GGG AAG GaC 552 
Arg Gin lie Tyr Gly Tyr Asp Ser Arg Phe Ser lie Phe Gly Lys Asp 

130 135 140 

TTC CTC CTC AAC TaC CCT TTC TCA ACA TCA G'fG AAG TTA TCC ACG GGC 600 
Phe Leu Leu Asn Tyr Pro Phe Ser Thr Ser Val Lys Leu Ser Thr Gly 

145 150 155 

TGC ACC GGC ACC CTG GTG GCA GAG AAG CAT GTC CTC ACA GCT GCC CaC 648 
25 Cys Thr Gly Thr Leu Val Ala Glu Lys His Val Leu Thr Ala Ala His 
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10 


15 


20 


25 


160 165 170 I 75 

TOC ATA CAC GAT GGA AAA ACC TAT GTG AAA GGA ACC CAG AAG CTT CGA 696 

Cys lie His Asp Gly Lys Thr Tyr Val Lys Gly Thr Gin Lys Leu Arg 

180 185 190 

GTG GGC TTC C'fA AAG CCC AAG TTT AAA GAT GGT GGT CGA GGG GCC AAC 744 
Val Gly Phe Leu Lys Pro Lys Phe Lys Asp Gly Gly Arg Gly Ala Asn 

195 200 205 

GAC TCC ACT TCA GCC ATG CCC GAG CAG ATG AAA TTT CAG TGG ATC CGG 792 
Asp Ser Thr Ser Ala Met Pro Glu Gin Met Lys Phe Gin Trp lie Arg 

210 215 220 

GTG AAA CGC ACC CAT GTG CCC AAG GGT TGG ATC AAG GGC AAT GCC AAT 840 
Val Lys Arg Thr His Val Pro Lys Gly Trp He Lys Gly Asn Ala Asn 

225 230 235 

CAC ATC GGC ATG GAT TAT GAT TAT GCC CTC CTG GAA CTC AAA AAG CCC 888 
Asp lie Gly Met Asp Tyr Asp Tyr Ala Leu Leu Glu Leu Lys Lys Pro 
240 245 250 255 

CAC AAG AGA AAA TTT ATG AAG ATT GGG GTG AGC CCT CCT GCT AAG CAG 936 
His Lys Arg Lys Phe Met Lys lie Gly Val Ser Pro Pro Ala Lys Gin 

260 265 270 

CTG CCA GGG GGC AGA ATT CAC TTC TCT GGT TAT GAC AAT GAC CGA CCA 984 
Leu Pro Gly Gly Arg He His Phe Ser Gly Tyr Asp Asn Asp Arg Pro 

275 280 285 

GGC AAT TTG GTG TAT CGC TTC TGT GAC GTC AAA GAC GAG ACC TAT GAC 1032 
Gly Asn Leu Val Tyr Arg Phe Cys Asp Val Lys Asp Glu Thr Tyr Asp 
290 295 300 
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TIG CTC TAC CAC CAA TGC GAT GCC CAG CCA GGG GCC AGC GGG TCT GGG 1080 
Leu Leu Tyr Gin Gin Cys Asp Ala Gin Pro Gly Ala Ser Gly Ser Gly 

305 310 315 

GTC TAT GTG AGG ATG TGG AAG AGA CAG CAG CAG AAG TGG GaG CGA AAA 1128 
5 Val Tyr Val Arg Met Trp Lys Arg Gin Gin Gin Lys Trp Glu Arg Lys 
320 325 330 335 

ATT ATT GCC ATT TTT TCA GGG CAC CAG TGG GTG GAC ATG AAT GGT TCC 1176 
II© lie Gly He Phs Ser Gly His Gin Trp Val Asp Met Asn Gly Ser 
340 345 350 

10 CCA CAG GAT TTC AAC GTG GCT GTC AGA ATC ACT CCT CTC AAA TAT GCC 1224 
Pro Gin Asp Phe Asn Val Ala Val Arg lie Thr Pro Leu Lys Tyr Ala 

355 360 365 

CAG ATT TGC TAT TGG ATT AAA GGA AAC TAC CTG GAT TGT AGG GAG GGG 1272 
Gin lie Cys Tyr Trp He Lys Gly Asn Tyr Leu Asp Cys Arg Glu Gly 
l5 370 375 380 

TGACACAG TGTTCCCTCC TGGCAGCAAT TAAGGGTCTT CATGTTCTTA TTTTAGGaGA 1330 
GGCCAAATTG TTTTTTGTCA TTCGCGTGCA CACGTGTGTG TGTGTGTGTG TGTGTAAGGT 1390 
GTCTTATAAT CTTTTACCTA TTTCTTACAA TTGCAAGATG ACTGGCTTTA CTaTTTGAAA 1450 
ACTGCTTTGT GTATCATATC ATATATCATT TAAGCAGTTT GAAGGCATAC TTTTGCATAG 1510 
20 AAATAAAAAA AATACfGATT TGGGGCAATG AGGAATATTT GACAATTAAG TTAATCTTCA 1570 
CGTTTTTGCA AACTTTGATf TTTATnCAT CTGAACTTGT TTCAAAGATT TaTATTAAAT 1630 
ATTTGGCATA CAAGAGATAT GAATTCTTAT ATGTGTGCAT GTGTGTTTTC TTCTGAGATT 1690 
CATCfTGGTG G'fGGGTTTTT TTGTTTTTTT AATTCaGTGC CTGaTCTTTA aTGCTTCCAT 1750 
AAGGCAGTGT TCCCaHTAG GaaCTTTGaC AGCATTTCIT AGGCAGAATA TTTTGGaTTT 1810 
25 GGACGCATH GCATGGTAGT CTTTGAACAG TAAAATGATG TGTTGACTAT aQGATACAC 1870 
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atattaaact ataccttata gtaaaccagt 
tttcttttcc aaaggttgtt gctctaott 

CTTTAAAGTC ATACCaGAGT GGCCAAGaGT 

ttcactcaca tttctggaac tagctatttt 
5 acaggctgta tttcctccca gcaaacagtt 
tacccctgga ttatagcaca tctcatgttt 
ttaaattcca gacaacaatg gaagcattgc 
gtttggagcc tggcacagtc ctccagcctg 

TGCTTTCTGG GAGCTATGTA OTCTTCaaT 
io aaatacttgg aagttacttt AAGAAAACCA 

GGCCGCTTTT GCTGGAATGC TCTAGGTTAT 
GAAAATTGGA AGMTGCAAA ATGGATCAGA 
TGTTTTATCA atatgattat CAAATCACAG 

tgtttttatt itatggctct cgccctaagc 
15 caaatggact acaagcacgt gtttgctgtc 
aatttgtaag gatattcaga tggagcactg 
cttgtctttc ttgagctttt tggaaggata 

CCACAGTGCT ttcttcaaat CATATGAGAA 
GCCAGGAAAA TTCTGAGTTC cagcacaatt 

20 aggaagaagg gaggtctcca tttctatgtc 
tTagcttggt gaaaaaaagt tcactgaaca 

TAGATGTTCC TTTTGTGAAG CACCTTGaTT 

aatggcacm agtcaaaatg AAATCAATGT 
GAATGATACa cccatatgct atatacagct 

25 AAATAATTCA ACATGTCCaT CTTTTTAGTG 
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ATCCCAAGCT GCTTTTAGTT CCAAAAATAG i930 

GTAGGAAGTC TTTGCATATG GCCCTCCCAA 1990 

GTTTATCCCA ACCCTTCCAT TTAACaGGAT 2050 

TCAGAAGACA ataatcaggg cttaattaga 2110 

GTGGCCACAC TAAAAACAAT CATAGCATTT 2170 

TATCATTTGG ATGGAGTAAT TTAAAATGAA 2230 

CTGGCAGATC TCACAACAGA ATAACCaCTT 2290 

ATCAAAAATT ATTCTGCaTA GTTTTCAGTG 2350 

TTGGAAACTT TTCTCTCTCA TTTATAGTGA 2410 

GTGTGGCCTT TTTCCCTCTA GCTTTAAAAG 2470 

AGATAAACAA TTAGGTATAA TAGCAAAAAT 2530 

ATCATGCCTT CCMTAAAGG CCTTTACaCa 2590 

CATATACAGA AAAGACTTGG AOTATTGTA 2650 

ACTTCTTTCT AAATGTATCG GaGaAAAAAT 2710 

CTTGCACCCC AGGTAAACCT GCATTGTAGC 2770 

TCACTTAGAC ATTCTCTGGG GGATTTTCTG 2830 

ATTCTGATAA GGCACTCAAG AMCGTACAA 2890 

ATACTATGCA TAGCAAGGAG ATGCAGAGCC 2950 

TTCTTTGGAA TCTAACAGGA ATCTAGCCTG 30 10 

TGGTATTTGG GGGTTTTGTT TGTTTTTGCT 3070 

CCAAGACCAG AATGGaTTTT TTTAAAAAAA 3130 

CCTTGATTTT GATTTTTTGC AAAGTTAGaC 3190 

TTAGTTCACA aGTAGATGTA ATTTACl'AAA 3250 

TAACTCaCaG AACTGTAAAA GAAAATTATA 3310 

ATAATAAAAG AAAGCATGGT A'fTAMCTAT 3370 
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CATAGAAGTA GACAGAAAAA GAAAAMGGA CTCaTGGCAT TATTaATATA ATTAGTGCTT 3430 
TACATGTGTT AGTTATACAT ATTAGAaGCA TATTTGCCTA GTAAGGCTAG TAGAACCaCA 3490 
TTTCCCAAAG TGTGCTCCTT AAACACTCAT GCCTTATGAT TTTCTACCAA MGTAAAAAG 3550 
GGTTGTATTA AGTCAGAGGA AGATGCCTCT CCATTTTCCC TCTCTTTATC AGAGGTTCaC 3610 
5 ATGCCTGTCT GCACATTAAA aGCTCTGGGA aGaCCTGTTC TAAAGGGACA AGTTGAGGTT 3670 
GTAAAATCTG CATTTAAATA AACATCTTTG ATCAC 3705 
[0058] 

Brief Description of the Drawings: 

Figure 1: A figure depicting the 

10 hydrophobicity/hydrophilicity profile of the protein 

encoded by clone HP01207. 

Figure 2: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 

encoded by clone HP01862. 
15 Figure 3: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 

encoded by clone HP10493. 
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Document Name: Abstract 
Abstract: 

Problems to be Solved: To provide human proteins having 
transmembrane domains, cDNAs coding for these proteins/ 
5 expression vectors of said cDNAs and eucaryotic cells 
expressing said cDNAs. 

Means to Solve the Problems: proteins containing any of 
the amino acid sequences represented by Sequence Nos. 1 to 
3, DNAs coding for these proteins, exemplified by cDNAs 

10 containing any of the base sequences represented by 
Sequence Nos, 4 to 6, expression vectors of said cDNAs, as 
well as eucaryotic cells expressing said cDNAs. Said 
proteins and eucaryotic cells having said proteins on the 
surface of membrane can be provided by expressing cDNAs 

15 encoding human proteins having transmembrane domains and 
recombinants of these human cDNAs. 
Selected Figure; None 
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Fig. 2 
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Fig. 3 
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